A Combination of Hypercolumn Model with Hidden Markov Model for Japanese Lip-Reading System

نویسندگان

  • Alaa Sagheer
  • Naoyuki Tsuruta
  • Rin-Ichiro Taniguchi
  • Sakashi Maeda
  • Seiji Hashimoto
چکیده

In recent years, lip-reading systems have received much attention, since they play an important role in human communication with computer especially for hearing impaired and elderly people. In this paper, we introduce a novel Japanese lip-reading system combines Hypercolumn Neural Network model (HCM) with Hidden Markov Model (HMM). In this system, we use HCM to extract the visual speech features from input image. The extracted features are modeled by Gaussian distributions, which is used in recognition phase using HMM. The proposed lip-reading system can work under varying lip positions and sizes. Our experiments were carried out using multiple sentences of Japanese language. All images were captured in a natural environment without special lighting or lip markers used. Experimental results demonstrate that the proposed system performance is supreme.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Intrusion Detection Using Evolutionary Hidden Markov Model

Intrusion detection systems are responsible for diagnosing and detecting any unauthorized use of the system, exploitation or destruction, which is able to prevent cyber-attacks using the network package analysis. one of the major challenges in the use of these tools is lack of educational patterns of attacks on the part of the engine analysis; engine failure that caused the complete training,  ...

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Efficient face model for lip reading

There is number of researches on the lip reading. However, there is little discussion about which face model is effect for lip reading. This paper builds various face models which changes the combination of a face part, and changes the feature points. Various experiments were conducted on the conditions which change only model and do not change other algorithms. We apply the active appearance m...

متن کامل

MAN-MACHINE INTERACTION SYSTEM FOR SUBJECT INDEPENDENT SIGN LANGUAGE RECOGNITION USING FUZZY HIDDEN MARKOV MODEL

Sign language recognition has spawned more and more interest in human–computer interaction society. The major challenge that SLR recognition faces now is developing methods that will scale well with increasing vocabulary size with a limited set of training data for the signer independent application. The automatic SLR based on hidden Markov models (HMMs) is very sensitive to gesture's shape inf...

متن کامل

Visual Recognition System for Hearically Impaired Person –A Review

this paper gives the idea about lip reading. Generally image processing is done to process an image for different application. There is variety of transform base feature extraction method. Visual recognition system or lip reading method is important generally in noisy condition. The new modality in image processing area is gives you dictation of voice. Keywords— ANN (Artificial Neural Network),...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004